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ABSTRACT 



< 

Context. Stars form in dense, dusty clumps of molecular clouds, but little is known about their origin and evolution. In particular, the 
relationship between the mass distribution of these clumps (also known as the "clump mass function", or CMF) and the stellar initial 
I ■ mass function (IMF), is still poorly understood. 

Aims. In order to discern the "true" shape of the CMF and to better understand how the CMF may evolve toward the IMF, large 
samples of bona-fide pre- and proto-stellar clumps are required. The sensitive observations of the Herschel Space Observatory (HSO) 
are now allowing us to look at large clump populations in various clouds with different physical conditions. 

Methods. We analyse two fields in the Galactic plane mapped by HSO during its science demonstration phase, as part of the more 
complete and unbiased Herschel infrared GALactic Plane Survey (Hi-GAL). These fields undergo a source-extraction and flux- 
estimation pipeline, which allows us to obtain a sample with thousands of clumps. Starless and proto-stellar clumps are separated 
using both color and positional criteria to find those coincident with MIPS 24 /im sources. 

Results. We describe the probability density functions of the power-law and lognormal models that are used to fit the CMFs, and 
we then find their best-fit parameters. For the lognormal model we apply several statistical techniques to the data and compare their 
results. 

Conclusions. The CMFs of the two SDP fields show very similar shapes, but very different mass scales. This similarity is confirmed 
by the values of the best-fit parameters of either the power-law or lognormal model. The power-law model leads to almost identical 
CMF slopes, whereas the lognormal model shows that the CMFs have similar widths. The similar CMF shape but different mass scale 
represents an evidence that the overall process of star formation in the two regions is very different. When comparing with the IMF, 
we find that the width of the IMF is narrower than the measured widths of the CMF in the two SDP fields. This may suggest that an 
additional mass selection occurs in later stages of gravitational collapse. 

Key words, stars: formation - Stars: pre-main sequence - ISM: clouds - ISM: structure 

X: 

03\ 1. Introduction Starless (or pre- stellar, if gravitationally bound) clumps rep- 

resent a very early stage of the star formation (SF) process, 
before collapse results in the formation of a central protostar, 

c . e •■ j»i fiiiju ,r»i and the physical properties of these clumps can reveal important 

Stars form in dense, dusty clumps of molecular clouds, but little , , , . , ,., ' . 

• , , ... • • • 1 , .• , .• ... clues about their nature: mass, spatial disttibutions and lifetime 

is known about their origin and evolution (sometimes the term . ,. , 
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, . j- . -1 ca. i /ii mg to the formation of the clumps from the parent molecular 

ship between the mass distribution of these clumps (also known , , T .... . „ , „, ,„ , T , „„ 
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f , , • . e ■ u u e tu j-fc Therefore, large samples of bona-fide starless clumps are lmpor- 
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• , .. ,,, „ ■ ■■■ , , e . , tant for comparison of observations with various SF models and 
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and proto-stellar clumps from an otherwise unremarkable col- 
lection of high column density features. 
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Previous stu dies from data s ets ob t ained with ground 
facilities (e.g.. iTesti & Sargent! [l998l iMotte et all f l998. 
Nutter & Ward-Thompsonll2006l lEnoch et al.ll2008l lAlves et alj 
20081 and ISadavov et alJ 120101) have revealed that CMFs can 
roughly follow either power-law or lognormal shapes, which 
in some cases closely resemble the observed stellar IMF. 
Unfortunately, these works have als o emphasized the difficult y 
in discerning the form of the CMF (ISwift & Beaum onll2010l) . 
In fact, in some cases relatively small regions within larger 
clouds were examined or, even when the observations produced 
surveys over larger areas, they were carried out at a single 
wavelength (e.g., 850/im, 1.1mm) and used different set of 
conditions to identify "clumps" in molecular clouds. 

This scenario changed recently thanks to submillimeter con- 
tinuum surveys based on telescopes placed on (sub)orbital plat- 
forms, namely the Balloon-borne Large Aperture Submillimeter 
Telescope (BLAST; iPascale et al.ll2()08) and the Herschel Space 
Observatory (HSO). BLAST carried out simultaneous observa- 
tions at A = 250, 35 0, and 500 um of several Galactic star form- 
ing regions (SFRs: IChapin et al l 120081 iNetterfield et"al] 120091 
Olmi et al. 20091 IRov et al.ll2.Ql ll) . The even more sensitive and 
higher-angular resolutions observations of HSO are now allow- 
ing us to look at large clump populations in various clouds with 
different physical conditions, while using a self-consistent anal- 
ysis to derive their physical parameters. For example, the first re- 
sults from the Herschel Gould Belt Survey confirm t hat the shape 
of the pre-stellar CMF res embles the stellar IMF dAndre et al.l 
l20irilKonvvesetal.ll2010h . 

In this first paper we analyse two fields in the Galactic plane 
mapped by HSO during its science demonstration phase (SDP). 
The two fields observed represent a sample of the more complete 
and unbiased Herschel infrared GALactic Plane Survey (Hi- 
GAL). Hi-GAL is a key program of HSO to carry out a 5-band 
photometric imaging survey at 70, 160, 250, 350, and 500 pm of 
a \b\ < l°-wide strip of the Milky Way Galactic p lane, originally 
plann ed for the longitude range -60° < / < 60° ( Molinari et al.1 
2010b), and then extended in subsequent proposals to the whole 
Galactic plane. 

The two SDP fields have been thoroughly analyzed and have 
also been used to test various methods of source extraction. In 
addition, they have different global properties that make them 
interesting for the purposes of this work. Here we use these two 
regions as a test bed for methods of analysis that will be later 
applied to the rest of the Hi-GAL survey. Therefore, the conclu- 
sions of this work should be considered preliminary and specific 
for the SDP fields. 

The outline of the paper is the following: in Section [2] we 
give a general description of the Hi-GAL data. In Section [3] 
we describe the source extraction technique and the photom- 
etry specifically adopted in this work, while we describe how 
the spectral energy distributions (SEDs) were assembled in 
Section |4] The statistical analysis of the CMFs is carried out in 
Sections [5] and [6] We discuss our results in Section Q and draw 
our conclusions in Section [8] 

2. Observations 

The observations were carried out by HSO during the SDP that 
took place in November 2009. Five waveba nds were sim ultane- 
ously observed: the SPIRE instrument (Griffin et al. 2010) at A = 
250, 3 50, and 500 fxm, and the PACS instrument (Poglitsch lstall 
2010) at A =70 and 160 pm, were used (see Tab.[T]l. The two ob- 
served fields were centered at I - 30° and / = 59° and the final 
maps spanned 2° in both Galactic longitude and latitude. 



Table 1. PACS/SPIRE wavebands and beam FWHM. The mea- 
sured beam FWHM for PACS at 70 pm is much larger than its 
nominal diffraction FWHM. 



Instrument 


Band 
[pm] 


Beam FWHM 
[arcsec] 


PACS 


70 


9.2 


PACS 


160 


12.0 


SPIRE 


250 


17.0 


SPIRE 


350 


24.0 


SPIRE 


500 


35.0 



The detailed description of the observation settings and scan- 
ning strategy ad opted as well as the map generation proce- 
dure is given in Molin ari et al.l d2010a|lbh. Im ages of the SDP 
fields can be found in [Molinari et al.l d2010al) and an analysis 
of various general properties of these re gion s can be found i n 
iBattersbv etaD (l201ll) . lBillot etalJ (l2010h and lElia et all (l2010h . 
Here we summarize the most relevant SFRs known in both re- 
gions. 

2.1. The I = 30° region 

The I = 30° region that has been analyzed is approximately 
4 deg 2 in size and is do minated b y the SRBY 162 (named af- 
ter lSolomon et al.ll 19871 see al so Moon ev et alJ 19951 also called 
W43-main. see lNguven Luong et alJl201 lh and SRBY 171 (also 
called W43-south) SFRs, with a total mass of several times 
1O 6 M . The clump of W43-main harbors a well-known gi- 
ant HII region p owered by a very luminous (~ 3.5 x 10 6 L Q , 
Blum et akl l 19991 and references therein) cluster of Wolf-Rayet 
and OB stars. W43-south corresponds to a less extreme cloud, 
which also harbors a smaller HII region, the well-kno wn ultra- 
compact HII region G29.96-0.02 dCesaroni et al.lll998l) . 

Recent analysis of the W43-main region, also in the con- 
text of the Hi-GAL project, revealed a complex structure that 
could be resolved into a dense cluster of protostars, infrared dark 
clouds, and ridges of warm dust heated by high-mass stars, thus 
confir ming its efficiency in forming massive stars dBallv et al.l 
l2010t) . Whi le the two SDP fie l ds seem to have similar clustering 
properties dBillot et alj|201 ll) . IBattersbv et alj d201 lh show that 
the median temperatures and the column densities, for all the 
pixels in the source masks consider ed, are higher in the £ = 30° 
field than in I = 59°. In addition. IBattersbv et ail (1201 ll) also 
speculate that the fact that the fraction of pixels with absorption 
at 8 pm in the i - 59° field is so much lower than that in the 
I = 30° field, could suggest that there is a lower fraction of cold, 
high-column density clouds in the I = 59° field. 

Finally, the richness of the I = 30° region in young mas- 
sive stars has also been associated with it being approximately 
located at the interaction regio n between one end of the Galactic 
bar and the Scutum spiral arm dGarzon et al.lll997h . 

2.2. The I = 59° region 

Contrary to the t = 30° region, the i - 59° field is not lo- 
cated at the tip of the Galactic bar, but belongs instead to the 
Sagittarius spiral arm. This region covers approximately 5 deg 2 
and it is prominent in images of therma l dust emission as w ell 
as in the radio an d the optical (see, e.g.. lChapin et al.ll2008l and 
iBillot et al.ll2010h . The most active SFR is the Vulpecula OB 
association which hosts the star cluster NGC 6823 and three 
bright Hn regions, Sh2-86, 87 and 88. The stellar HR diagram 
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for NGC 6823 has been examined bv lMassev etail (1 19951) . They 
find an age of 5-7 Myr for the bulk of the stars. 

The far-infrared (60 and lOOjum) emission in this region 
is dominated by several l uminous high -mass SFRs. Three of 
these have been studied by Beltran "et al.l d2006l) and lZhang et alJ 
d2005l) and are associated with the IRAS sources 19368+2239, 
19374+2352, and 19388+2357. A further four IRAS regions 
(19403+2258, 19410+2336, 1 9411+2306, and 194 13+2332) 
have been studied extensively by Beut her et al.l d2002i) . using CS 
multi-line, multi-isotopologue observations and the 1 .2 mm dust 
continuum. 



3. Source Photometry and identification of compact 
sources 

There is no standard terminology to identify the compact sources 
extracted in bolometer maps by various existing algorithms. 
However, while "core" usually refers to a smaller-scale object 
(^ 0. 1 pc), possibly corresponding to a later stage of fragmenta- 
tion, the term "clump" is generally used for a somewhat larger 
(^ 1 pc), unresolved object, possibly composed of several cores 
dWilliams et al.ll2000l) . Our maps are likely a collection of both 
cores and clumps; however, given the di stances of the two fi elds 
being analyzed here (see Section 141 and iRusseil et alJl201 ll) . we 
think the term clump is more appropriate to refer to the compact 
objects extracted in the SDP fields. We also note that since we 
are constructing the clump (and not core) mass functions, with 
the clumps likely being composed of smaller fragments, no at- 
tempt will be made here to separate the gravitationally bound 
and unbound sources. 

3.1. Source and flux extraction in the SPIRE/PACS maps 

As we mentioned in the introduction, the SDP fields have been 
useful test beds for various methods of source extraction and 
brightness estimation. However, for the purposes of this work 
it was also necessary to adopt a method that in the end would 
be able to determine source masses, and thus the CMF, with a 
better accuracy compared to the original sou rce extract i on and 
brigh tness estimation pip eline described by lElia et all (2010) 
and iMolinari et alJ (1201 ll) . This is achieved in two ways: first, 
the method outlined here defines in a consistent manner the 
region of emission of the same volume of gas/dust at differ- 
ent wavelengths, thus differing from th e source group i ng and 
band-mer ging proce dures described by Molin ari et alJ d201 lb 
and lElia et al.l (120101) . In addition, the SED fi tting procedure is 
more accurate compared to that described by lElia et al.l {2010) 
(see Sectional. 

The source extraction and brightness estimation techniques 
applied to the Hi-GAL maps in this work are similar to the meth- 
ods used during anal ysis of the BLAST05 ( Chapin et al. 2008) 
and BLAST06 data jNetterfield etd] 120091 lOlmi et all 120091 
However, important modifications have been applied to adapt the 
technique to the SPIRE/PACS maps, as described below. 

Candidate sources are identified by finding peaks after a 
Mexi can Hat Wavelet typ e convolution (MHW, hereafter; see, 
e.g., iBarnard et ail 120041) is applied to all five SPIRE/PACS 
maps. Initial candidate lists from 70, 160 and 250^/m are then 
found and fluxes at all three bands extracted by fitting a com- 
pact Gaussian profile to the source. Sources are not identified 
at 350 and 500 /urn due to the greater source-source and source- 
background confusion resulting from the lower resolution, and 
also because these two SPIRE wavebands are in general more 



distant from the peak of the source SED. The Gaussian-fitted 
sources are then first selected based on their integrated flux 
(which cannot be lower than assigned values in each waveband) 
and also on their FWHM, which is allowed to vary from 80% of 
the beam (to allow for pixelization effects on point sources) to 
90 arc sec. 

Each temporary source list at 70, 160 and 250//m is then 
purged of overlapping sources, by comparing the positions of 
nearby sources at a given wavelength using their estimated 
FWHM: if two sources are nearer than one-half of the sum of 
their respective FWHMs, they are taken as being the same ob- 
ject. Purged lists at 70 and 160 jum are then merged, and nearby 
70 and 160 /vm positions are treated as before. This merged 
70/160yum catalog is then compared with the (already purged) 
250 jum source list, and the same procedure is repeated for iden- 
tifying overlapping objects and merge the two catalogs. Thus, 
a final source catalog is generated that contains well separated 
objects detected from all three 70, 160 and 250 /vm wavebands. 

In the next stage, Gaussian profiles are fitted again to all 
SPIRE/PACS maps, including the 350 and 500 yum wavebands, 
using the size and location parameters determined at the shorter 
wavelengths during the previous steps (the size of the Gaussian 
is convolved to account for the differing beam sizes). The center 
of the new Gaussian fit is allowed to move at most by ^ 5 arcsec 
relative to the candidate source location, to allow for morpho- 
logical differences at 350 and 500 fim, and for fitting one single 
Gaussian profile at those locations where more than one candi- 
date source had been identified during the previous steps. Then, 
before writing the final catalog, sources are again selected using 
their final FWHM. Using this technique, 4702 and 2003 selected 
compact clumps were identified in the I = 59° and ( = 30° 
fields, respectively. 

Monte Carlo simulations are then used to determine the com- 
pleteness of thi s proce ss. Following the method outlined by 
iNetterfield et al.l d2009h . fake sources are added to the 160 and 
250 /urn maps and are then processed through the same source 
extraction pipeline. To generate these fake sources, we randomly 
select a fraction (~ 20 - 30%) of the sources in the final cata- 
log, we convolve them with the measured beam in each band and 
insert them back into the original maps. The locations of these 
new sources are chosen to be at least 2 arcmin from their origi- 
nal location, but not more than 4 arcmin, so that the fake sources 
will reside in a similar background environment to their original 
location. These added sources are not allowed to overlap each 
other, but are not prevented from overlapping sources originally 
present in the map so that the simulation will account for errors 
due to confusion. Therefore, the sample of fake sources approx- 
imately reproduces the distributions in intensity and size of the 
original catalog. 

The resulting set of maps, with both original and fake 
sources, is run through the source extraction pipeline and the ex- 
tracted source parameters are compared to the simulation input. 
The simulations are performed using smaller (^ 0.2 deg 2 ) maps, 
extracted from the original maps of the I = 59° and i — 30° 
fields (containing a few hundreds sources), in order to be able to 
run the source extraction pipeline multiple times, thus achieving 
a statistical average for the mass completeness limits (estimated 
from the 160yum maps, at the 80% confidence level), which are 
listed in Table [2] These values have been estimated for the me- 
dian distances of each field, also listed in the same table, and 
typical values of T — 20 K and j3 — 2 (the dust emissivity index, 
see Section |4| have been used to convert the flux completeness 
limit into a mass completeness limit. 
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Table 2. Median values of mass, temperature, distance and completeness limits toward the £ = 30° and I = 30° fields. 



Population 




e = 


30° field 






e = 


59° field 






Temperature 


Mass 


Distance 


Completeness 


Temperature 


Mass 


Distance 


Completeness 




[K] 


[M ] 


[kpc] 


[M Q ] 


[K] 


[M ] 


[kpc] 


[M Q ] 


All 


21.8 


99.6 


7.6 


73.0 


17.6 


2.1 


3.6 


0.7 


Starless 


19.7 


112.6 






17.4 


2.2 






Proto-stellar 


24.3 


89.9 






19.8 


1.9 







3.2. Separating starless and proto-stellar clumps 
3.2.1. Color criteria 

The catalogs of sources compiled with the technique described 
in {sl3.ll do not attempt to separate starless and proto-stellar 
clumps. These populations must be separated using indepen- 
dent methods, to ensure that they can be accurately character- 
ized. Various criteria can be found in t he literature (see e.g., 
De Luca et all I20071 lEnoch et all I20091 iNetterfield et aD [2009L 
Olmi et al.ll2009l) . that use both proximity criteria to infrared ob- 
jects and temperature criteria. The different approaches may dif- 
fer in their operational definition of the positional criteria and the 
identification of proto-stellar phases based on SED shape and/or 
other color criteria. 

In this work we combine color and positional criteria. This 
approach has the advantage of being less computationally ex- 
pensive, and least biased to particular models or model parame- 
ters, a s compared to a full SED approach (e.g., Robitaill e et alj 
2006). Given the range of spatial resolutions and spectral cover- 
age within the SPIRE/PACS wavebands, this combined approach 
should also be least biased as compared to positional criteria ap- 
plied to data sets with single resolutions and limited spectral cov- 
erage. 

Young protostars still embedded in dense clumps should 
peak in the far-infrared due to the absorption and reprocess- 
ing of light to longer wavelengths by envelopes. Several recent 
studies ha ve identified these sources using IRAC and MIPS col - 
ors (e.g. JHarvev et al.ll2OO6llj0rgensen et al.ll2006ll2007l l2008). 
Here, ho wever, we also i nclud e the somewhat more restrictive 
criteria of Sadavov et alJ d2010h . In fact, since embedded proto- 
stars peak in the far-infrared, we also require that proto-stellar 
clumps have strong detections at 24 and 70 fim and rising-red 
colors. The red colors will exclude stellar sources, which have 
fiat colors in the infrared regime, but not extragalactic sources. 
Extragalactic contamination must be e xcluded separately , for ex - 
ample following the color criteria of iGutermuth et afl ([2008). 
IRAC and MIPS colors were assigned to the Hi- GAL sources 
using the MIPSGAL catalog dShenov et alj|2012l) and accord- 
ing to positional criteria (see Section [3. 2. 2\ . Our color criteria 
for identifying proto -stellar objects are thus as follows (see also 
ISadavov et aljlioiOh : 

- (A) The source flux at 24//m has a signal-to-noise ratio 
(S/N)> 3 and the source 70 //m flux is higher than 0. 1 Jy. 

- (B) Source colors are dissimilar t o those of star-forming 
galaxies (see IGutermuth et al]|2008l) . i.e., 

[4.5] -[5.8] < ^ ([5.8] - [8.0] - 1), and 

[4.5] -[5.8] > 1.05, and 
[5.8] -[8.0] > 1. 

- (CI) If the source is detected at 24 /im then, 
[8.0] - [24] > 2.25, and 

[3.6] -[5.8] > -0.28 ([8.0]- [24]) + 1.88 



- (C2) If the source is not detected at 24 /mi (i.e., due to the 
lower sensitivity), then the IRAC bands are used, 

[3.6] -[5.8] > 1.25, and 
[4.5] -[8.0] > 1.4. 

It should be noted, however, that sources with no 24 /mi detec- 
tion, and possibly even those with no 70 /mi counterpart, can- 
not be defi nitively classified as starless, at least in a low-mass 
clump (see lChen et al. 2010). These kind of sources may in fact 
represent a stage intermediate between a gravitationally-bound 
starless clump (i.e., pre-stellar) and a Class protostar, and are 
difficult to identify. However, apart from these elusive objects, 
our color-criteria should be able to efficiently separate starless 
and proto-stellar clumps on a statistical basis. Separating grav- 
itationally bound and unbound clumps is beyond the scopes of 
this work. 

3.2.2. Positional criteria 

The positional criteria to associate a MIPSGAL counterpart to 
the Hi-GAL sources could make the usual assumption that a 
given clump is to be considered proto-stellar only if a young 
stellar object (YSO) candidate is found in the region of a clump 
where the intensity of emission is higher, which is generally as- 
sociated with the peak submillimeter flux. However, this simple 
scenario may become more complicated in confused regions, 
where several clumps are blended together, at which point the 
peak value could be off-center with respect to the volume occu- 
pied by the individual clumps. 

The search for YSO candidates near a given submillimeter 
clump is further complicated by the fact that clumps can be ir- 
regular in shape. In these cases, a circular approximation of the 
clump extent, by using the radius obtained with a 2-D Gaussian 
fit to the clump intensity distribution, could lead the search al- 
gorithm to actually probe regions beyond the "real" boundaries 
of the clump. On the other hand, a fixed angular tolerance could 
clearly lead to either underestimate or overestimate the number 
of YSO counterparts to the submillimeter clumps. 

An alternative algorithm to ensure that the observed size 
and shape of t he cl ump is considered, has been suggested by 
ISadavov et al .1 (1201 Oh . in which the object location is compared 
to a percentage of the difference bet ween the peak submill imeter 
intensity and the boundary intensity. ISadavov et alJ (120101) use a 
constant value for the boundary intensity, but in the case of the 
SPIRE/PACS maps, where the clump shape and intensity has to 
be defined with respect to the local background, we used a dif- 
ferent approach. 

First, a 1 arcmin box is extracted from the PACS 160 //m flux 
density map, centred around each catalog clump. To the purpose 
of this procedure, we think the 160 /mi map constitutes the best 
trade-off between sensitivity and angular resolution. An average, 
local background level is then estimated and subtracted from the 
map. Intensity at the nominal position of the clump is evaluated 
in this background-subtracted map; the algorithm also examines 
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10 -10 -20 

Arc Seconds 
Longitude 29.67 Latitude -0-23 




0-0 20 10 -10 -20 

Arc Seconds 
Center: Longitude 30-17 Lotitude 0.49 

Fig. 1. Top. Example of a Hi-GAL starless clump (nominal posi- 
tion represented with a yellow "+" sign) with an irregular shape 
(shown by the black area, against the gray-scale background, 
representing the 160 jum emission). No MIPSGAL or MHW24 
source (shown with the "★" and "diamond" symbols, respec- 
tively) is falling within the black area and thus they cannot be 
associated with the Hi-GAL source. Bottom. Example of a proto- 
stellar Hi-GAL clump. One MHW24 source is associated with 
the submillimeter clump, but no MIPSGAL source satisfies the 
coincidence and color criteria. 



the nearby pixels in case the (local) peak submillimeter intensity 
is off-center from the clump nominal location. The search area 
for coincidence with YSO candidates is then defined as all those 
pixels interior to the contour corresponding to a fraction / = 0.7 
of the peak intensity value found in the previous step (see Fig.Q]). 

The fraction / = 0.7 has been determined by trial-and- 
error, and thus there were always several cases where identifying 
unique YSO candidates within the specified contour was not al- 
ways clear, particularly in crowded regions or when a dim clump 
was found near a much more intense source. In these cases, 
the search area could result in an elongated or quite irregular 
shape, and we thus introduced further constraints; for example, 
the YSO candidate must be located within a maximum angular 



distance from the center of the clump, which is a function of the 
clump FWHM. We note that the arbitrarity of the / = 0.7 value 
may lead to some cross-contamination of the starless and proto- 
stellar samples, but it should be statistically comparable in the 
two SDP fields. 

The presence of a Mid-Infrared source near a Hi-GAL clump 
was chec ked in two different w ays. First, the MIPSGAL catalog 
was used (She nov et alj|2012l) . In order to complement this cat- 
alog, we also decided to apply the source extraction algorithm 
described in CTTI to the mosaicked 24 ^m MIPS maps (MIPS24, 
hereafter) of the ( = 59° and I = 30° fields (hereafter, we will re- 
fer to these MIPS24 sources as MHW24 objects). This approach 
had the advantage of being able to detect MIPS24 sources that 
could have been missed by the point response function fitting 
applied by the MIPSGAL team dCarev et al.ll2009l) . The applica- 
tion of the MHW method also ensured that MIPS24 sources were 
extracted in a similar and uniform fashion to the SPIRE/PACS 
maps. 

Thus, for each Hi-GAL clump, once the contour defined 
by / = 0.7 was determined, the presence of both MIPSGAL 
and MHW24 sources within this contour was checked. If a 
MIPSGAL source satisfies the coincidence criterion, and if in 
addition its colors satisfy the criteria described in 33.2.11 then 
this object is assumed to be an embedded YSO and its associated 
Hi-GAL clump is considered to be proto-stellar. If a MHW24 
source source satisfies the coincidence criterion, and if it also 
satisfies the color criterion (A) above, then this object is equally 
assumed to be an embedded YSO and its associated Hi-GAL 
clump proto-stellar. Thus, if either a MIPSGAL or MHW24 
source satisfies the color and coincidence critaria, then the as- 
sociated Hi-GAL clump is considered proto-stellar. 

4. Submillimeter-MIR SEDs 



As discussed bv lOlmi et al. (2009), our goal here is to use a sim- 
ple, single-temperature SED model to fit the sparsely sampled 
photometry described in 33.11 which will allow us to infer the 
main physical parameters of each clump: mass, temperature and 
luminosity. These quantities must be interpreted as a parameteri- 
zation of a more complex distribution of temperature and density 
in the clump and the equally complex response of each instru- 
ment to these physical conditions. 

In this work we mainly follow the method described by Olmi 
et al. (2009, and references therein). Contrary to these authors, 
however, we do not assume optically-thin emission and use a 
general isothermal m odified blackbody (or gray-body) emission 
of the type (see, e.g jMezger et al.lrl990h . 



S v = £l s B v (T) [1 -exp(-T d )], 

where rj, the dust optical depth, can be written as: 



A (v 
T d = — — 

lV 



-Q, 



(1) 



(2) 



where A is a constant, B V (T) is the Planck function, B is the dust 
emissivity index, O s is the source angular diameter (evaluated at 
160 /mi) and the emissivity factor is normalized at a fixed fre- 
quency vo. We then write the factor A in terms of a total (gas 
+ dust) clump mass, M, the dust mass absorption coefficient kq 
(evaluated at Vq), and the distance to the object, d: 



Mk q 
R„ d d 2 ' 



(3) 
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Fig. 2. Top. SED of a source in the I = 30° field. The upper 
limit at 70 yum is shown and black line shows the best-fit mod- 
ified blackbody, whereas gray lines show the 68% confidence 
envelope of modified blackbody models from Monte Carlo sim- 
ulations. Bottom. Another Hi-GAL clump with a more peculiar 
shape of the SED, due to the use of equation (Q]) instead of the 
optically-thin approximation. 



The di stance to individual clumps was taken from lRusseil et alJ 
(1201 ll) when available and otherwise set to the median value (see 
Table [2]). Since kq refers to a dust mass, the gas-to-dust mass 
ratio, /? g d, is required in the denominator to infer total masses. 
We adopt kq - 1 1 cm 2 g" 1 , e valuated at vo = c/250/mi, and 
R gd =s 100 riMartinetal.1120121) . 

Equation © is fit to all of the five SPIRE/PACS fluxes (with 
the PACS 70 yt/m flux used as upper-limit, see below), using x 2 
optimization. Color correction of the SPIRE/PACS flux densities 
is performed using the filter profiles, and color-corrected fluxes 
are thus used in all subsequent applications. Besides to A and T, 
also B and Q s are allowed to vary during the x 2 optimization, for 
a total of four free parameters. 

As far as the parameter £2 S is concerned, this method will 
generally result in a different value of the source size as com- 



pared to that obtained during the source extraction procedure 
described in Section 13.11 This is because the source extraction 
procedure yields a source size mainly based on the source shape; 
however, the source size delivered by the x 1 optimization of the 
SED depends solely on the source photometry. In general, we 
find that the latter method tends to underestimate the source size 
obtained during the source extraction procedure. 

Uncertainties for all model parameters are o btained from 
Monte Carlo simulations (see IChapin et al.ll2008l) . Mock data 
sets are generated from realizations of Gaussian noise. The x 2 
minimization process is repeated for each data set, and the result- 
ing parameters are placed in histograms. Means and 68% confi- 
dence intervals are then measured from the relevant histograms. 
Fig. [2] shows example SEDs obtained with this procedure; the 
black line consists of the best-fit modified blackbody at wave- 
lengths > 70yum and the solid gray lines indicate the 68% con- 
fidence envelope of modified blackbodies that fit the SPIRE and 
PACS data. 

The 70 ,um flux is always treated as an upper-limit, because 
it would otherwise cause systematic deviations from a single- 
temperature gray-body fit. In fact, a non-negligible fraction of 
the flux at this wavelength is emitted by the warmer proto-stellar 
object, either formed or in advanced stage of formation at the 
center of the clump, while most of the emissi on at A > 160/zm 
originates in the colder envelope of the clump (lElia et al.l l2010). 
We adopt "survival analysis" to properly in clude the upper-limi t 
in the calculation ofy 2 (see the discussion in lChapin et alj2 008). 

5. Description of models used to fit the CMF 

The goals of our analysis are to find the best fit parameters 
for some selected models given the data. In the following sub- 
sections we give a general description of the mathematical func- 
tions used in this analysis, whereas in the Appendices we outline 
the details of the numerical implementation, where all proce- 
dures were written in the Interactive Data Languag^J 



5.1. Definitions 

For the sake of mathematical convenience we will approximate 
discrete power-law and lognormal behavior with their continu- 
ous counterparts. Therefore, if dN represents the number of ob- 
jects of mass M lying between M and M + dM, the number den- 
sity distribution per mass interval (or CMF), $ (M ) = dN /dM, is 
defined through the relation (e.g., Chabrier 2003): 



d^ = f 0ogM) 
% dM MlnlO 



1 



dN 



M In 10/ dlogM 



(4) 



thus, £(M)dM represents the number of objects with mass M 
lying in the interval [M, M + dM]. The probability of a mass 
falling in the interval [M, M + dM] can be written for a continu- 
ous distribution as p(M)dM, where p(M) represents the mass 
probability density function (PDF). The PDF and CMF must 
obey the following normalization conditions: 



r 

Jm 

j 

-f 



Mm 



p(M)dM = 1 and 



%{M)dM = 



log(M sup ) 



£(logAf)d log M = iV tc 



(5) 



log(M inr ) 



IDL; http://www.ittvis.com/ProductServices/IDL.aspx 
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where N lot is a normalization constant which, for the case of dis- 
crete data, can be interpreted as the total number of objects being 
considered in the sample. From Eq. (O p(M) can also be written 
as p(M) = £(M)/jV tot . 

Mi n f and M sup denote respectively the inferior and superior 
limits of the mass range for the objects in the sample, beyond 
which the distribution does not follow the behavior specified 
by the PDF or CMF For example, the power-law density (see 
Section lB. 1 . It diverges as M — > so its formal distribution can- 
not hold for all M > 0; there must be some lower bound to the 
power-law behavior, which we denote by M m f . More in general, 
Mi„f and M sup should give us a more quantitative estimate of the 
mass range where the assigned PDF gives a better description of 
the data. 

In the following, we will also make use of the complemen- 
tary cumulative distribution function (CCDF), which we denote 
P C {M) and which is defined as the probability of the mass to fall 
in the interval |m, M sup ], i.e.: 



P C (M) 



JM 



p(M')dM' 



(6) 



5.1 .1 . Powerlaw form 

The most widely used functional form for the CMF is the power- 
law: 



£ pw (log M) 
£ P w(M) 



A pw M- a , 



or 



In 10 



(7) 
(8) 



where A pw is the normalizati on constant. T he original Salpeter 
value for the IMF is a = 1 .35 (lSalpeterlll955l) . 

The PDF of a power-la w (continuous) distribution is given 
by (e.g JClauset et al.ll2009l) : 



p p „(M) = C pw M 



(9) 



where the normalization constant can be determined by applying 
the condition in Eq. (0, yielding: 



lognormal PDF in density. A CMF consistent with a lognormal 
form has also been ob served in recent surveys of nearby SFRs 
(see, e.g., Enoch 2006). The continuous lognormal CMF can be 
written (e.g., Chabri erS2003h : 



ft„(ln M) 



V27TC 



exp 



(In M-p) 2 



2cr 2 



(ID 



where p and <x 2 = ((InM - (InM)) 2 ) denote respectively the 
mean mass and the variance in units of In M. A\ a represents a 
normalization constant which is evaluated in AppendixlAl 

The PDF of a continuous lognormal distribution can be writ- 
ten as (e.g JClauset et alJl2009l) : 



P\niM) = — exp 



(InM - p) 2 



2cr 2 



(12) 



Cln r 21 



Again, by applying the normalization condition, Eq. (|5j, we find: 



C in = **[~^Pi X [ erfc (*inf) ~ erfc(x sup )] 



(13) 



where the variables x, Xi n f and jf sup are defined in AppendixlAl 
and we note that the parameters M m f and M sup are not necessarily 
the same as those determined for the power-law distribution. As 
we already mentioned in Section IB. 1.1 1 if the condition M sup » 
M; n f holds, then we can write: 



Cin - \ — t x [erfc(jc inf )Y 
ncr L 



(14) 



Then, by using the definition of PDF given in Section lBTTI and 
the relations for the erf and erfc functions shown in AppendixlAl 
the CCDF for the lognormal distribution, P[ n (M), can be written 

as: 



f' n (M) 



erfc(x) - erfc(x sup ) 
erfc(xjnf) - erfc(x sup ) 



(15) 



Cpw (Mr- - M-«) 



(10) 



For a > and M S UD » M m f one can use the approximation 
C pw ^ aM^ n{ as in lClauset etaD (2009) and lSwift & Beaumontl 
(2010) (when the proper adjustments for the different definition 
of the power-law exponent in Eq. (|8] ) are done). 

According to Elmegreen (1985) power-laws tend to arise 
when only the fragments can fragment, and lognormals arise 
when both the fragments and the interfragment gas can frag- 
ment during a hierarchical process of star formation. However, 
the power-law functional form is also widely used because 
of its versatility. Past surveys of SFRs (see for example 
ISwift & Beaumontl 1201 Ol and references therein) have shown a 
variety of values for a, and the same dataset can be typically 
fit by one or more power-laws with different slopes. In particu- 
lar, the power-law behavior does not extend to very low masses, 
where it displays a turnover or break below typically a few M . 

5.1.2. Lognormal form 

Another widely used functional form for the CMF is the lognor- 
mal, which can be rigorously justified because the central limit 
theorem applied to isothermal turbulence naturally produces a 



6. Fitting the data 

6.1. Evaluating the global CMF 

In this section we apply several statistical methods to analyze 
the CMF of the two regions, and find the best-fit parameters 
for power-law and lognormal models. An important question is 
whether the source sample for which the CMF is constructed 
should undergo more specific selection criteria, such as distance, 
cloud or cluster location, etc. The two SDP fields may be di- 
vided into smaller sub-regions, each containing clumps in var- 
ious stages of evolution as well as already formed stars. Each 
sub-region is in turn characterized by locally different mass dis- 
tributions, which may be fun ctions of the r adial distance from 
the region's center (see, e.g., iDib et al]|2010l) . The mass func- 
tion of clumps in an entire SDP field is thus the sum of all the 
sub-regions and "local" distributions. 

The analysis of the effects of all these local distributions 
on the CMF of a larger (several sq. degrees) region is out of 
the scopes of this work. We do not attempt to separate the 
complete source sample of a given SDP region into somewhat 
smaller sub-samples. Here we limit ourselves to address the dif- 
ferences, if any, between the global CMF of the two regions 
and postpone the study of the aforementioned effects to future 
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Fig. 3. Left panels. Log-log histograms showing the distribution of M for all (black), starless (red) and proto-stellar (blue) clumps 
in the t - 30° (top) and 6 = 59° (bottom) fields. The bin width has been chosen according to the Freedman and Diaconis' rule (see 
Section ROI l. The dashed lines represent the results of fit to the data using the procedure PLFIT. The vertical dashed line corresponds 
to the value of M m f, whereas the solid vertical line shows the 80% completeness limits (see Section [3~T1 and Table|2|. Vertical bars 
show Poisson errors. Right panels. Histograms showing the distribution of clump temperature (color codes are as in the left panels) 
in the Z = 30° (top) and i = 59° (bottom) fields. The bin width has also been chosen according to the Freedman and Diaconis' rule. 
Line styles are as before. 



Table 3. Best-fit parameters (from PLFIT) to the CMF of th e I _ 30° and ( ■■ 
value of the power-law exponent for the IMF is a = 1.35 dSalpeterlll955l) . 



59° fields, for the power-law distribution. The Salpeter 



Population 




e = 30° field 




£ = 59° field 








a 


M m[ 




a 










[M Q ] 






[M Q ] 


All 


1.15 


±0.15 


212 ±79 


1.20 


±0.15 


7.3 ± 


2.2 


Starless 


1.12 


+ 0.18 


206 ± 95 


1.23 


±0.17 


7.5 ± 


2.3 


Proto-stellar 


1.06 


±0.21 


138 ± 105 


0.58 


±0.22 


0.7 ± 


2.5 



work. Thus, while each source has been assigned a specific dis- 
tance, when evaluating the distance effects between the two SDP 
fields we only consider the median distance of each region (see 
Section 17.3. U . Clearly, this also means that the CMF will be 
affected by distance-dependent sensitivity and completeness ef- 
fects. However, this approach will allow us to determine if there 
are any significant differences with previous surveys, where the 
source sample is usually smaller and confined to a single cloud. 



6.2. Fitting the power-law form with the method of maximum 
likelihood 

Various methods exis t to fit a parametric mode l to an astronom- 
ical dataset (see, e.g., Babu & Feigelson 2006). One of the most 
popular methods to analyze the mass spectra of the starless and 
proto-stellar clump populations, consists of placing the masses 
of individual clumps in logarithmically spaced bins, with a lower 
limit on the error estimated from the Poisson uncertainty for each 
bin. The resulting CMF can then be fitted with either a power- 



8 



L. Olmi et al.: On the shape of the mass-function of dense clumps in the Hi-GAL fields 



law or a lognormal function. Then, the resulting best-fit slope, 
a, and the parameters of the lognormal function, fi and cr, usu- 
ally depend somewhat on the histogram binning, and the selected 
mass range, particularly when "small" samples of sources are 
used. 

Alternatively, we hav e considered a method described by 
iBabu & Feigelsonl d2006h and more in details by IClauset et alj 
(120091) . that consists of a statistically principled set of techniques 
that allow fo r the validation and quantification of power-laws. 
According to IClauset et al.l (|2009), this method should be quite 
immune to the significant systematic errors that may affect the 
histogram technique, includi ng those uncertai nties associated 
with the histogram binning. IClauset et al. (2009) have described 
a procedure that implements both the discrete and continuous 
maximum likelihood estimator (MLE) for fitting the power-law 
distribution to data, along with a goodness-of-fit based approach 
to estimating the lower cutoff of the data. Hereafter, we will re- 
fer to this procedure simply as "PLFIT", from the name of the 
main MATLAB0 function performing the aforementioned statis- 
tical operations. The results of the PLFIT method are shown in 
Fig-Sand Table[3] and are discussed in Section|7] 

6.3. Fitting the lognormal form 

Like Section 16.21 also in the case of the lognormal distribution 
we want to avoid the uncertainties inherent in fitting data using 
regression models arising from data binning. The first method 
we describe here is thus based on Bayesian regression tech- 
niques. In particular we have used WinBUGS0, a programming 
language based software that is used to generate a random sam- 
ple (using Markov chain Monte Carlo, or MCMC, methods) 
from the posterior distribution of the parameters of a Bayesian 
mode l. As it is cus tomary in Bayesian regression techniques 
fe.g.. lGregorvll2005l) . once the posterior distributions of the pa- 
rameters of interest have been generated, they can be analyzed 
using various descriptive measures. In Table|4]we show the mean 
values obtained for the fi and cr parameters from 10000 samples 
MCMC runs in WinBUGS. We note that the procedure imple- 
mented in WinBUGS does not currently allow to estimate the 
Mi n f and M sup parameters. In the left panels of Fig. |4]we also 
show the histograms of the ln(M) values, with the solid lines 
representing Guassian fits to the histograms obtained with a stan- 
dard regression technique. Both histograms and fits are shown 
for graphical purposes only. 

While we consider the results from the Bayesian regression 
technique our baseline results, we also want to compare them 
with two alternative methods. The first one is based on the com- 
putation of the PDF of the mass distribution, and is actually de- 
pendent on data binning. To compute the PDF, we plot in the 
right panels of Fig. [4] the normalized counts, i.e., the count per 
bin divided by the product of the total number of data points in 
the sample, N, and the (linear) bin width. For this normaliza- 
tion, the area under the histogram is equal to one, as described 
in Section 15.11 From a probabilistic point of view, this normal- 
ization results in a relative histogram that is most akin to the 
PDF. In addition, the bin width, W, has been chosen according to 
the Freedman and Diaconis' rule (Freedm an & Diaconisl [l981) 
W = 2(IQR)N~ l/3 , where IQR represents the inter-quartile 
range. The best-fit results to the PDF from standard regression 
techniques are then listed in Table 



2 http://www.mathworks.it/ 

3 http://www.mrc-bsu.cam.ac.uk/bugs/welcome.shtml 



The final method we describe here to fit the CMF with a log- 
normal distribution, consists of implementing a MLE method 
for the lognormal function, similar to the one described in 
Section [672l for the power-law distribution. The main advantage 
of the MLE technique, compared to the other methods, is to al- 
low the computation of the M- m f and M sup parameters. Therefore, 
we have numerically maximized the likelihood of the distribu- 
tion in Eq. (TT~3T > as a fu nction of cr and yu, using Powell's method 
(e.g., Press et al. 2002). The results of this method are also listed 
in Table[5](4 th and 5 th column). In this first instance of the MLE 
method for the lognormal distribution, however, the values of 
Mj n f and M sup were arbitrarily fixed to constant values. 

Alternatively, we may consider the values of Mj n f and M sllp 
to be also unknown. These parameters can then be found by 
minimizing the Kolmogorov-Smirnov (KS) statistic between the 
best fit model and the data as a function of M m f and M sup (see 
Appendix IB1. These latter results are listed in the columns 6-9 
of Table 

7. Discussion 

7.1. i = 30° field 

As shown in Section l6T2l because of our large sample of sources, 
we were able to apply the PLFIT procedure not only to the 
whole sample of clumps detected toward the £ = 30° field, but 
also to the starless and proto-stellar clump samples, separately. 
Irrespectively of the specific sample used, we obtained a =s 1 . 1 
(see Fig. [3] and Table |3j. For M; n f, which represents the break 
or turnover below which the distribution does not follow power- 
law behavior, we find M- m f ~ 200 M , except for the sample of 
proto-stellar clumps where M m f = 138 Ms but it has a large un- 
certainty. 

In comparing our results with previous surveys we prefer to 
use the results of ISwift & Beaumont (2010), who applied sim- 
ilar statistical techniques to various (smaller) datasets from the 
literature. In particular, for the power-law functional form, we 
prefer not to compare our best-fit parameters with correspond- 
ing parameters obtained using regression models depending on 
data binning, which are subject to large uncertainties, especially 
when the sample is relatively small (a few hundred sources or 
less). 

Thus, in terms of the power-law functional form, the es- 
timated value of a agrees very well with the typical val- 
ues found by ISwift & Beaumontl d2Q1 0'Pl for both low- and 
high-mass SFRs. On the other hand, the estimated value of 
Mj n f ~ 200 Mq, is h i gher c ompared to the values estimated 
by Swift & Beaumont (2010) for intermediate- and high-mass 
SFRs. We also note the similar values of a and (to a lesser ex- 
tent) Mi n f for the starless and proto-stellar clump samples. 

As far as the lognormal fits are concerned, Table|4]shows that 
the starless population in t — 30° has a slightly higher (lower) 
value of /j. (cr) compared to the proto-stellar population. More 
Hi-GAL fields need to be observed to determine whether this 
is a general property. In terms of the other statistical methods, 
in Table [5] we note that while the results of the MLE and PDF 
methods yield values quite similar to the Bayesian results, the 
values of cr can differ significantly (by almost a factor of 2). 
We also note the relatively small value of M sup , compared to the 
whole range of masses in the t = 30° field. 

It is worth noting that the completeness limit of the t - 30° 
field (see Table |2]l is lower than the Mj n f value. Therefore, the 

4 Their a values correspond to our a + 1 values. 
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Fig. 4. Le/f panels. Histograms showing the distribution of ln(M) for all (black), starless (red) and proto-stellar (blue) clumps in the 
£ = 30° (top) and £ = 59° (bottom) fields. The solid lines represent the results of Gaussian fits to the histograms. The vertical solid 
lines represent the completeness limits as in Fig. [3] Vertical bars show Poisson errors. Right panels. PDFs of the mass distribution 
(black solid line) and best-fit (red dashed line), in the £ = 30° (top) and £ = 59° (bottom) fields. 

Table 4. Mean values, obtained using WinBUGS, from the posterior distributions of the p and cr parameters of the lognormal 
distribution, shown separately for the £ = 30° and £ = 59° fields. 



Population 


£ = 30° field 


6 = 59° field 






cr 




cr 




[In Mq] 


[In M ] 


[In M Q ] 


[In Mq] 


All 


4.58 ± 0.03 


1.57 ±0.03 


0.91 ± 0.02 


1.43 ± 0.02 


Starless 


4.76 ± 0.05 


1.50 ±0.03 


0.92 ± 0.02 


1.42 ±0.02 


Proto-stellar 


4.41 ±0.05 


1.62 ±0.04 


0.89 ± 0.07 


1.54 ±0.04 



peak of the ln(M) distribution in Fig. |4] is real, though barely 
constrained. Likewise, the turnover or break in the CMF of Fig. [3] 
is also effectively observed. However, the fact that the turnover is 
not better constrained may render all later comparisons between 
the power-law and lognormal distributions problematic. 

7.2. £ = 59° field 

As shown in Table [3] (see also Fig. [3} we find that the values 
of a in the £ = 59° field are also approximately equal for the 
whole sample and the starless clumps. These values are consis- 
tent, within the errors, with those found in the £ = 30° field, as 



discussed in Section 17.3.11 However, the a value for the proto- 
stellar clumps is indeed lower than that for the starless sample. 
Given the much smaller number of proto-stellar clumps in the 
£ - 59° field, we need to further investigate whether this could 
be a consequence of statistical uncertainty, or it is instead a true 
property of this region. 

Interestingly, the value of M m f estimated in the £ = 59° re- 
gion is much lower than that of the £ = 30° field, as expected 
given that the £ = 59° field is mostly a low- to intermediate- 
mass SFR. However, contrary to the £ — 30° region, our es- 
ti mated value of M m f is co mparable to the values estimated 
bv lSwift & Beaumont! (l2010h for low-mass SFRs. On the other 
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Table 5. Best-fit parameters to the CMF of the i = 30° and I = 59° fields, for the lognormal distribution and the PDF and MLE 
methods. The PDF best-fits are shown in the right panels of Fig. |4] 



Rej 


;ion 


PDF best-fit 


MLE" 




MLE with KS 








II cr 


fi cr 




cr 


M M 


M sup 






[lnM ] [lnM e ] 


[lnM ] [In Mq] 


[In Mq] 


[In Mq] 


[Mq] 


[Mq] 


e = 


30° 


4.5 1.2 


4.5 2.3 


4.7 


2.9 


20 


394 


i = 


59° 


0.54 1.2 


1.1 2.6 


0.53 


1.9 


0.5 


10.8 



Notes. We arbitrarily chose M inf = 1 M Q and M sllp = 500 M Q for the I = 30° region, and M inf = 0. 1 M G and M sup = 60 M e for I = 59°. 



hand, while ISwift & Beaumont] (|2010) report a wide range of 
CMF slopes (corresponding to a ~ 0.7 - 3.1), our estimated val- 
ues of a do not appreciably vary from the I = 30° to the ( = 59° 
field. 

In terms of the lognormal functional form of the CMF to- 
ward the { = 59° field, the results shown in Fig. [4] and Tables [4] 
and[5]confirm that the mass range in the two SDP fields are quite 
different (see Table 0. As in the ( = 30° region, Table [5] shows 
that the values of cr obtained through the MLE and PDF meth- 
ods can significantly differ from the Bayesian results. We have 
yet to determine if this is a feature associated with the different 
methods. 

Finally, the completeness limit of the I = 59° region is quite 
lower than its corresponding M m f value. Therefore, the peak of 
the ln(M) distribution in Fig.|4]is at least partially resolved, and 
the detection of a turnover in the CMF is more reliable than it is 
in the £ = 30° field. 

7.3. Comparing the two SDP fields 
7.3.1. Distance effects 

Comparing now the results for the I = 30° and I = 59° fields, 
we have already noted how the values of a are similar for the 
two regions. Therefore, if the evolution toward the IMF of high- 
mass clumps {( = 30°) is different from those of low- and 
intermediate-mass clumps (I = 59°), it leaves no trace on the 
shape of the CMF of these two regions. This conclusion, how- 
ever, is critically dependent on the correcteness of the distance 
estimates, and in any case it is the result of a survey over a 
large collection of different SFRs. In fact, our findings in the 
I = 30° field are different from those of iNetterfield et all (2009) 
who found a different slope of the CMF for the cold and warm 
populations of clumps in the Vela-C region, which is a less het- 
erogeneous region compared to our field. 

We have also seen that the values of p. and M m f are clearly 
different for the two regions (see Tables [4] and [5]). However, and 
despite the uncertainty on the absolute value of cr, when com- 
paring the results obtained with the same regression technique in 
the two SDP fields, we find that the values of cr are remarkably 
similar. We can also estimate the average values from all four 
methods used, and we find (cr} = 2.0 ± 0.8 and 1.8 ± 0.6 [In M Q ] 
for ( = 30° and t = 59°, respectively. These average values are 
also c onsistent with the range of cr found bv lSwift & Beaumont! 
d2010D . ~ 0.7 to 3.4, who also found the variation in the values 

of n ( 5 .4 to 2.7) larger compared to that of cr. Therefore, the 

histograms representing the distribution of the ln(M) values in 
the two SDP fields (Fig.|4]i are characterized by decidedly differ- 
ent mass scales, but are quite similar in shape, as it can be noted 
in the left panels of Fig. [4] 

This similarity can be noted even more clearly in the top 
panel of Fig. [5] where the histogram profiles from Fig. |4j for 
the ( = 30° and t = 59° fields, are shown side by side. We also 




0.1 1.0 10.0 100.0 

M [M ] 

Fig. 5. Top. Histogram profiles from Fig. |H with solid and 
dashed lines representing the I = 30° and ( = 59° fields, respec- 
tively. The dash-dotted line represents the distribution obtained 
for the t = 59° field traslated to the distance of ( = 30° (see text). 
Bottom. CMF profiles from Fig. [3} line styles are as before. The 
vertical lines mark the completeness limits listed in Table [2] for 
the I = 30° (solid line) and t = 59° (dashed line) fields. 



compare the CMFs in the bottom panel of Fig. [5] Apart from dif- 
ferences due to binning, one can clearly see that the mass distri- 
butions are very similar, although this similarity should be taken 
as far as the completeness limits actually allow to. 

Thus, although a full comparison of the mass spectra, includ- 
ing the mass range below the peak of the ln(M) distribution, will 
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require higher sensitivities than those achieved in the two SDP 
fields, the existence of a mass scale difference between the two 
regions is clear. It seems unlikely that the different mass scales 
could be due entirely to differences in the median distance (see 
Tab.[2]i, since this could account for a factor ~ 4 at most in mass 
sensitivity. 

To test this issue more quantitatively, we have performed a 
simulation where the Z = 59° field is traslated to the same dis- 
tance as the t = 30° region. In this procedure, the angular reso- 
lution in all of the original maps is first degraded by convolving 
with a beam in each band enlarged for the increased distance, 
and then the maps are re-binned accordingly (A. Facchini, priv. 
comm.). The resulting maps are run through the source extrac- 
tion and SED-fitting pipelines, and the extracted SED parame- 
ters are estimated by scaling the distance of each source by the 
ratio of the median distances in the Z = 30° and Z = 59° fields. 

Our test thus shows that the mass of the sources in the 
traslated maps can increase due to the larger distance and, to a 
lesser extent, due to the "merging" of sources that are detected as 
separate objects in the original maps of the Z = 59° region. The 
new mass distribution is shown in Fig.[5j where we can clearly 
see that the CMF for the traslated t = 59° region lies between the 
original CMFs of the t = 59° and Z = 30° fields. This appears 
to confirm our earlier assumption that distance effects alone can- 
not explain the overall difference in mass scales between these 
two specific SFRs. We thus think that this mass scale effect is 
evidence that the overall process of star formation in the two 
regions must be radically different. 

7.3.2. Clump formation efficiency 

It is too early to speculate about the physical origins of the dif- 
ferences described in the previous section. In fact, we have ex- 
ploited only a few percent of the Hi-GAL survey in the present 
analysis. With more and more regions being analyzed we expect 
to find statistically significant indications that there may indeed 
exist different Galactic star-forming regimes, as stated in the Hi- 
GAL scientific goals. For the present analysis, the prominence 
of the ( = 30° region in star-f orming indicators, compared to 
Z = 59° dBattersbv et alj |201 ll) including the (triggered) W43 
"mini-starburst" complex dBallv et al.l l2010). and the mass scale 
difference we found between the two regions, could all be related 
with the Z = 30° field being located near the interaction region 
between one end of the Galactic bar and the Scutum spiral arm, 
where high concentrations of shocked gas are more like ly to be 
found dGarzon et al ] fl997L lLopez-Corredoira et al]fl999h . 

In order to further test this scenario, we have estimated an 
additional figure of merit, the clump formation efficiency (CFE), 
that we define as: 

CFE = (16) 

^^clumps "t" ^^clouds 

where M c i umps represents the total mass of the clumps (above 
completeness limit) and M c i ou d s is the mass of the ambient gas. 
Given that the two SDP fields represent a collection of regions, 
possibly at different distances, rather than being a single molec- 
ular cloud at a single distance, two obvious problems must be 
considered. First off, we have to select a method to map the to- 
tal column density, without being limited by threshold, opacity 
and temperature variations in the ambient gas/dust. Second, the 
dis tance to separate parcels of gas must be estimated. 

iGoodman et alj d2009h have discussed and compared sev- 
eral methods for measuring column density in molecular clouds 



and they conclude that dust extinction is likely the best probe. 
Therefore , we have downloaded the A v e xtinction mapjj ob- 
tained by iRowles & Froebrichl d2009l) and iFroebrich & Rowlesl 
d201Q ) and used them to estimate the total column density in the 
two SDP regions. Specifically, we have selected only those pix- 
els with A v > 1 and we have assigned them a distance based 
on the closest (within a 3 arcmin radius) mass clump previously 
identified. Then, the extinction in each pixel has been converted 
to column density using the conversio n factor Nh = P\ A y cm~ 2 
where /3 V = 2 x 10 21 cnr 2 mag^ 1 dSavage & Mathisl 1 19791) . 
Therefore, the total mass in the clouds has been calculated as: 

Mciouds = AQ/^v m m J] d\k\ (17) 
i 

where AQ is the solid angle subtended by each pixel in the ex- 
tinction maps, p - 1.38 takes into account the cosmic He abun- 
dance, me2 is the mass of molecular hydrogen, and c/; and A\ rep- 
resent the distance and visual extinction toward the /-th pixel. 

Our estimated CFE thus amounts to 2.4 % and 0.7 % for 
the / = 30° and I = 59° regions, respectively. Although the deter- 
mination of the CFE is subject to several uncertainties (mainly 
due to the distance estimates), this is an indication that clumps 
may indeed form more efficiently in the / = 30° rather than in the 
I = 59° field. However, since the relationship of the CFE with 
the star formation efficiency {S FE) is not known, our present 
analysis is not yet conclusive that the S FE is also higher in the 
I = 30° field. 

7.4. Comparing the CMF to the IMF 

The qualitative similarity, observed in past studies, between the 
CMF and the IMF offers support for the accepted idea that stars 
form from dense clumps, and thus comparing the two distribu- 
tions should allow us to learn how observed samples of clumps 
evolve into stars. This comparison is actually a complex task be- 
cause CMFs are often different but the IMF appears to be quite 
universal. In addition, it is difficult to understand whether the 
CMFs of different regions are intrinsically different, or to what 
degree systematic differences in each dataset (either observa- 
tional or from post-processing) may contribute to the variations 
see n from dataset to dataset. 

ISwift & Williams! (|2008) have shown that different evolu- 
tionary pathways from clumps to stars produce variations in 
the form of the resultant IMF. They also showed that while 
the power-law slope is quite robust, the width of the lognor- 
mal distribution is a more sensitive indicator of clump evolution. 
As we showed earlier, the average value of cr in the two SDP 
fields is ~ 1.9 [In My,], c onsistent with the range of <x found by 
Swift & Beaumont (2010), and with an uncertainty of as much 
as 40%. However, the width of the IMF has been measured to be 
narrower, between 0. 3 and 0.7 [In M a ] (e .g., Chabrier 2003j). As 
already suggested by ISwift & Beaumo nt (2010) this would ap- 
pear to indicate that an additional mass selection occurs in later 
stages of gravitational collapse. 

8. Conclusions 

We have analysed two fields mapped by the SPIRE and PACS 
instruments of HSO during its science demonstration phase. The 
two fields, which are part of the Herschel infrared GALactic 
Plane Survey, were centered at / = 30° and / = 59° and the 
final maps covered almost lOdeg 2 of galactic plane. 

5 http://astro.kent.ac.uk/extinction/ 
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The two regions underwent a source-extraction and flux- 
estimation pipeline, which allowed us to obtain a sample with 
thousands of clumps. We then applied several statistical methods 
to analyze the resulting CMFs, and found the best-fit parameters 
for power-law and lognormal models. No attempt was made to 
select more uniform sub-samples, except for starless and proto- 
stellar clumps. Our main conclusions are the following: 

- Our best-fit parameters for the power-law distribution show 
a robust slope (a =s 1.2, with a ~ 15% uncertainty) when 
comparing the two SDP fields. In contrast, we find a very 
different value of the parameter M;„/ for the two regions 
analyzed. We find that M; n f is higher than the completeness 
limit in each region and is thus well defined. 

- We have used several statistical techniques to estimate the 
best-fit parameters of the lognormal functional form. For 
each separate method, the values of the width, cr, in the 
two SDP fields are remarkably similar. The average values, 
(cr) = 1.9 ±0.8 and 1.8+0.6 [In Af ] for £ = 30° and I = 59°, 
respectively, are also very similar. Like M m f, the value of 
the characteristic mass, p., is very different in the two regions. 

- The similarity of a and cr on one side, and the difference of 
Mjnf and fi on the other, show that the CMFs of the two SDP 
fields have very similar shapes but different mass scales 
which, according to our simulations, cannot be explained by 
distance effects alone. This represents an evidence that the 
overall process of star formation in the two regions is very 
different. 

- The similarity of the shape of the CMF in the two SDP 
regions suggests that if the evolution toward the IMF of 
high-mass clumps (£ = 30°) is different from those of low- 
and intermediate-mass clumps (t = 59°), it leaves no trace 
on the shape of the CMF. 

- The width of the IMF is narrower than the measured values 
of cr in the two SDP fields. This suggests that an additional 
mass selection occurs in later stages of gravitational col- 
lapse. 



Appendix A: Normalization of the lognormal Mass 
Function 

By applying the normalization condition (O to the lognormal 
MF in Eq. (fTTb . we get: 



f 



■ln(M sup ) a 



ln(M inr ) 



V27TO- 



exp 



(In M-fi) 2 



2cr 2 



din M 



= N, r 



which by changing the variable of integration can be easily trans- 
formed into: 



exp(-x )dx = N tc 



(A.l) 



where we have defined the variable x(M) = (In M - //)/( V2<x), 
and jq n f = x(M; n f) and x sup = jc(M sup ). Then, by using the fol- 
lowing relation 



I exp(-* )dx + I exp(-x )dx + 

Jo ~ Jx, 



f 



+ I exp(-;r)dx = I exp(-x )dx = 



■X2 JO 

which converts into: 



r 



exp(-x 2 )dx = [1 - erf(jti) - erfc(jci)] = 



(A.2) 



[erfc(xi) - erfc(x2)] 



we can write Eq. dA.ll ) as: 

~Y [l - erf(^i n f) - erfc(.x sup )] = 



(A.3) 



where the erf and erfc are defined as: 



2 r x 

erf(x) = — — I exp(-f z )df and 
Jn Jo 



erfc(x) = 1 — erf(x) 



2 r 

V^r Jx 



exp(-r )df . 



Finally, we can write the normalization constant, A;„, as: 

Ai„ = 2Mot x (A.4) 
,^ (In M inf - nf \ crfc / (In M sup - nf 



2o- 2 



2cr 2 



Appendix B: Estimating the M inf and M sup 
parameters 

The most common and easiest ways of choosing the M- m f and 
M sup parameters (for both the power-law and lognormal func- 
tional forms) are either to take the minimum (above the com- 
pleteness limit) and maximum values of the mass range in the 
dataset, or to plot a histogram of M and choose M m f and M sup 
based on a (arbitrary) threshold occupancy for the bins. A more 
robust approach is desirable. In the method we use here, we 
choose the values of M- m f and M sup that make the probability dis- 
tributions of the observed data and the best-fit lognormal model 
as similar as possible in the range |M; n f, M sup J dClauset et al.l 
I2009h . 

There are a variety of measures for quantifying the dis- 
tance between two probability distributions, and following 
IClauset et al.1 (120091) we choose the Kolmogorov-Smirnov (or 
KS) statistics, which is simply the maximum distance between 
the CCDFs (see Section FSTTb of the observed data, P° bs (M), and 
the fitted model, P™ d (M): 



D = max, 



|P° bs (M) - P™ d (M)\ 



(B.l) 



The procedure that implements this method thus follows four 
basic steps: 

- (1) choose Mi n f and M sup from selected (arbitrary) intervals; 

- (2) calculate the MLE values of fi and cr using Powell's 
method; 

- (3) apply the KS statistic to the interval [M; n f , M sup J and es- 
timate D; 

- (4) go back to step (1) and keep exploring the (M; n f,M sup ) 
space. 

At the end of this procedure, we choose the values of fi, cr, M- in f 
and M sup that minimizes D. N tot is thus the total number of ob- 
jects with mass in the range yM^f , M sup J. 

Acknowledgements. We thank Sean Carey for making the 24 yum catalog avial- 
able to us prior to its official release. 



13 



L. Olmi et al.: On the shape of the mass-function of dense clumps in the Hi-GAL fields 

References Zhang, Q., Hunter, T. R., Brand, J., et al. 2005, ApJ, 625, 864 

Alves, J., Lombardi, M., & Lada, C. J. 2008, A&A, 462, L17L21 
Andre, P., Men'shchikov, A., Bontemps, S., et al. 2010, A&A, 518, L102 
Babu, G. J. & Feigelson, E. D. 2006, in Astronomical Society of the Pacific 

Conference Series, Vol. 351, Astronomical Data Analysis Software and 

Systems XV, ed. C. Gabriel, C. Arviset, D. Ponz, & S. Enrique, 127 
Bally, J., Anderson, L. D., Battersby, C, et al. 2010, A&A, 518, L90 
Barnard, V. E., Vielva, P., Pierce-Price, D. P. I., et al. 2004, MNRAS, 352, 961 
Battersby, C, Bally, J., Ginsburg, A., et al. 201 1, A&A, 535, A128 
Beltran, M. T., Brand, J., Cesaroni, R., et al. 2006, A&A, 447, 221 
Beuther, H., Schilke, P., Menten, K. M., et al. 2002, ApJ, 566, 945 
Billot, N., Noriega-Crespo, A., Carey, S., et al. 2010, ApJ, 712, 797 
Billot, N., Schisano, E., Pestalozzi, M., et al. 201 1, ApJ, 735, 28 
Blum, R. D., Damineli, A., & Conti, P. S. 1999, AJ, 117, 1392 
Carey, S. J., Noriega-Crespo, A., Mizuno, D. R., et al. 2009, PASP, 121, 76 
Cesaroni, R., Hofner, P., Walmsley, C. M., & Churchwell, E. 1998, A&A, 331, 

709 

Chabrier, G. 2003, PASP, 115, 763 
Chapin, E. et al. 2008, ApJ, 681, 428 

Chen, X., Arce, H. G., Zhang, Q., et al. 2010, ApJ, 715, 1344 

Clauset, A., Shalizi, C. R., & Newman, M. E. J. 2009, SIAM Review, 51, 661 

De Luca, M., Giannini, T., Lorenzetti, D., et al. 2007, A&A, 474, 863 

Dib, S., Shadmehri, M., Padoan, P., et al. 2010, MNRAS, 405, 401 

Elia, D., Schisano, E., Molinari, S., et al. 2010, A&A, 518, L97 

Elmegreen, B. G. 1985, in Birth and Infancy of Stars, ed. R. Lucas, A. Omont, 

& R. Stora, 257-277 
Enoch, M. L., Evans, II, N. J., Sargent, A. I., & Glenn, J. 2009, ApJ, 692, 973 
Enoch, M. L., Evans, II, N. J., Sargent, A. I., et al. 2008, ApJ, 684, 1240 
Enoch, M. L. et al. 2006, ApJ, 638, 293 

Freedman, D. & Diaconis, P. 1981, Probability Theory and Related Fields, 57, 
453476 

Froebrich, D. & Rowles, J. 2010, MNRAS, 406, 1350 

Garzon, F, Lopez-Corredoira, M., Hammersley, P., et al. 1997, ApJL, 491, L31 

Goodman, A. A., Pineda, J. E., & Schnee, S. L. 2009, ApJ, 692, 91 

Gregory, P. C. 2005, Bayesian Logical Data Analysis for the Physical 

Sciences: A Comparative Approach with 'Mathematica' Support (Cambridge 

University Press) 
Griffin, M. J., Abergel, A., Abreu, A., et al. 2010, A&A, 518, L3 
Gutermuth, R. A., Myers, P. C, Megeath, S. T., et al. 2008, ApJ, 674, 336 
Harvey, P. M., Chapman, N, Lai, S.-R, et al. 2006, ApJ, 644, 307 
J0rgensen, J. K., Harvey, P. M., Evans, II, N. J., et al. 2006, ApJ, 645, 1246 
J0rgensen, J. K., Johnstone, D., Kirk, H., & Myers, P. C. 2007, ApJ, 656, 293 
J0rgensen, J. K., Johnstone, D., Kirk, H., et al. 2008, ApJ, 683, 822 
Konyves, V, Andre, P., Men'shchikov, A., et al. 2010, A&A, 518, L106 
Lopez-Corredoira, M., Garzon, F, Beckman, J. E., et al. 1999, AJ, 118, 381 
Martin, P. G., Roy, A., Bontemps, S., et al. 2012, ApJ, 751, 28 
Massey, P., Johnson, K. E., & Degioia-Eastwood, K. 1995, ApJ, 454, 151 
McKee, C. F. & Ostriker, E. C. 2007, ARA&A, 45, 565 
Mezger, P. G., Zylka, R., & Wink, J. E. 1990, A&A, 228, 95 
Molinari, S., Schisano, E., Faustini, F, et al. 2011, A&A, 530, A133 
Molinari, S., Swinyard, B., Bally, J., et al. 2010a, A&A, 518, L100 
Molinari, S., Swinyard, B., Bally, J., et al. 2010b, PASP, 122, 314 
Mooney, T., Sievers, A., Mezger, P. G., et al. 1995, A&A, 299, 869 
Motte, F, Andre, P., & Neri, R. 1998, A&A, 336, 150 
Netterfield, C. B., Ade, P. A. R., Bock, J. J., et al. 2009, ApJ, 707, 1824 
Nguyen Luong, Q., Motte, R, Schuller, R, et al. 201 1, A&A, 529, A41 
Nutter, D. & Ward-Thompson, D. 2006, MNRAS, 368, 1833 
Olmi, L., Ade, P. A. R., Angles-Alcazar, D., et al. 2009, ApJ, 707, 1836 
Pascale, E., Ade, P. A. R., Bock, J. J., et al. 2008, ApJ, 681, 400 
Poglitsch, A., Waelkens, C, Geis, N, et al. 2010, A&A, 518, L2 
Press, W. H., Teukolsky, S. A., Vetterling, W. T, & Flannery, B. P. 2002, 

Numerical recipes in C++ : the art of scientific computing, ed. Press, W. H., 

Teukolsky, S. A., Vetterling, W. T., & Flannery, B. P. 
Robitaille, T. P., Whitney, B. A., Indebetouw, R., Wood, K., & Denzmore, P. 

2006, ApJS, 167, 256 
Rowles, J. & Froebrich, D. 2009, MNRAS, 395, 1640 
Roy, A., Ade, P. A. R., Bock, J. J., et al. 201 1, ApJ, 727, 1 14 
Russeil, D., Pestalozzi, M., Mottram, J. C, et al. 2011, A&A, 526, A151 
Sadavoy, S. I., Di Francesco, J., Bontemps, S., et al. 2010, ApJ, 710, 1247 
Salpeter, E. E. 1955, ApJ, 121, 161 
Savage, B. D. & Mathis, J. S. 1979, ARA&A, 17, 73 

Shenoy, S. S., Carey, S. J., Noriega-Crespo, A., & others. 2012, ApJ, submitted 
Solomon, P. M., Rivolo, A. R., Barrett, J., & Yahil, A. 1987, ApJ, 319, 730 
Swift, J. J. & Beaumont, C. N. 2010, PASP, 122, 224 
Swift, J. J. & Williams, J. P. 2008, ApJ, 679, 552 
Testi, L. & Sargent, A. I. 1998, ApJL, 508, L91 

Williams, J. P., Blitz, L., & McKee, C. F. 2000, Protostars and Planets IV, 97 



14 



